Reconfigurable and Self-optimizing Multicore Architectures
نویسنده
چکیده
For the last 30 years, the microprocessor industry has relied on instruction-level par-allelism (ILP) and aggressive clock scaling to translate Moore's Law into exponential performance growth. Over this time frame, system performance has increased by a factor of 10, 000, enabling novel software applications and capabilities beyond the reach of earlier hardware platforms. Due to mounting challenges in power dissipation and system complexity, however, this long-standing approach to microprocessor performance scaling is no longer practical. The industry's initial response to the end of uniprocessor performance scaling is a paradigm shift towards Chip Multiprocessors (CMPs). Although a general consensus exists on the power, performance, and complexity advantages of CMPs, the way these systems will meet the intrinsically diverse requirements of the software products that will be running on them is much less clear. Both the initial adoption and the longevity of CMPs hinges on their ability to embrace a diverse and dynamically changing software ecosystem , consisting of software products with very different characteristics and in different stages of development. Unfortunately, this stands in sharp contrast to the rigid nature of existing CMP architectures, which are typically focussed on optimizing average or worst-case performance, and are statically tuned to perform well over a relatively narrow class of applications. This dissertation addresses the design of next-generation, flexible, efficient, and self-optimizing CMP architectures. Our work is structured in two phases. The first phase addresses flexibility in the execution substrate in terms of per-core performance and core count, a critical capability that is absent from existing CMP architectures. We propose core fusion, a reconfigurable chip multiprocessor (CMP) architecture where groups of fundamentally independent cores can dynamically morph into a larger CPU, or they can be used as distinct processing elements, as needed at runtime by applications. Core fusion gracefully accommodates software diversity and incremental parallelization in CMPs. It provides a single execution model across all configurations, requires no additional programming effort or specialized compiler support, maintains ISA compatibility, and leverages mature micro-architecture technology. The second phase of our work addresses flexibility and sophistication in the architectural control policies that manage scarce system resources. Efficiently utilizing off-chip DRAM bandwidth is a critical issue in designing cost-effective, high-performance CMP platforms. Conventional memory controllers deliver relatively low performance in part because they often employ fixed, rigid access scheduling policies designed for average-case application behavior. As a result, they cannot learn and optimize the long-term performance impact of …
منابع مشابه
مدل عملکردی تحلیلی FPGA برای پردازش با قابلیت پیکربندی مجدد
Optimizing FPGA architectures is one of the key challenges in digital design flow. Traditionally, FPGA designers make use of CAD tools for evaluating architectures in terms of the area, delay and power. Recently, analytical methods have been proposed to optimize the architectures faster and easier. A complete analytical power, area and delay model have received little attention to date. In addi...
متن کاملDesign of a novel congestion-aware communication mechanism for wireless NoC architecture in multicore systems
Hybrid Wireless Network-on-Chip (WNoC) architecture is emerged as a scalable communication structure to mitigate the deficits of traditional NOC architecture for the future Multi-core systems. The hybrid WNoC architecture provides energy efficient, high data rate and flexible communications for NoC architectures. In these architectures, each wireless router is shared by a set of processing core...
متن کاملA Virtual Layer for FPGA Based Parallel Systems (MP-SoCs)
Besides performance and time to market, robustness and reliability are important design targets for modern Systemson-Chip (SoCs). Despite these features the power consumption must be as low as possible. To meet these design goals parallel, flexible, and adaptive architectures are required [1]. Today, dynamically reconfigurable FPGAs are well suited to form a parallel architecture because they i...
متن کاملProgram Execution on Reconfigurable Multicore Architectures
Based on the two observations that diverse applications perform better on different multicore architectures, and that different phases of an application may have vastly different resource requirements, Pal et al. proposed a novel reconfigurable hardware approach for executing multithreaded programs. Instead of mapping a concurrent program to a fixed architecture, the architecture adaptively rec...
متن کاملMPEG RVC AVC Baseline Encoder Based on a Novel Iterative Methodology
With the emergence of new generations of multicore architectures, the need for efficient multimedia algorithm implementations has become critical. This paper describes a new methodology for efficient implementations of algorithms targeting reconfigurable architectures. The Reconfigurable Video Coding (RVC) standard aims to provide a framework allowing a dynamic development, implementation and a...
متن کاملMethods for Optimizing OpenCL Applications on Heterogeneous Multicore Architectures
Heterogeneous multicore architectures with CPU and add-on GPUs or streaming processors are now widely used in computer systems. These GPUs provide substantially more computation capability and memory bandwidth compared to traditional multi-cores. Also, because they are highly programmable, they provide the computational performance needed for realistic graphics rendering. Applications with gene...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008